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3 <110> APPLICANT: E . I. du Pont de Nemours and Company 

5 <120> TITLE OF INVENTION: Chorismate Biosynthesis Enzymes 

7 <130> FILE REFERENCE: BB-1159 

C--> 9 <140> CURRENT APPLICATION NUMBER: US/0 9/744,0 99 

C--> 10 <141> CURRENT FILING DATE: 2001-06-27 

12 <150> PRIOR APPLICATION NUMBER: 60/093,611 

W--> 13 <151> PRIOR FILING DATE: July 21, 1998 

15 <160> NUMBER OF SEQ ID NOS : 9 

17 <170> SOFTWARE: Microsoft Office 97 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 1628 
.21 <212> TYPE: DNA 

22 <213> ORGANISM: Zea mays 

-24 <400> SEQUENCE: 1 

25 ccgccgcttt ccgagctctc atggcggcgt ccgcgtcctc cctccttgcc 

-26 cctcatcctg tggggccatc tccccccagc tccctcgtgg agctccggcc 

127 tcgcttcgcc gtcgcgtcac tcctgctacc tcctccgcgc ttccccttcg 

28 ggagtcgctt cgtagccaac gccgctccca ccatgcagcc tccggcggag 

,29 ccacggtagt cgatgtcgac ctcggcgacc gtagctatcc gatctacatc 

30 tcctcgacga gccggacctg ctgcagaggc atgttcatgg taagagggtt 



31 ccaacacgac 

32 acctgaatgt 

33 acacgctgat 
"34 ttgtagcact 

35 tccggggcgt 

36 ttggtgggaa 

37 agccacaatg 

38 cttcaggcat 

39 ggcaagagaa 

40 tcaagagatc 

41 ttcgagcaac 

42 atggagcatg gctccatggg 

43 ctcaccgcct ggggtggata 

44 aagccaaact 

45 tggctgttga 

46 tagggtgctg 

47 cattctgcga 

48 ctatgtatca 
4 9 ttttttgtca 

50 tggctaatca 

51 cccttctgtc 

52 aaaaaaaa 

54 <210> SEQ ID NO: 2 

55 <211> LENGTH: 438 

56 <212> TYPE: PRT 

57 <213> ORGANISM: Zea 
59 <400> SEQUENCE: 2 



cgtcgcgccg ctttacctgg 
atcagtggaa agcgtgatcc 
gaaggtgttt gacaaggcag 
gggtggtggt gtcattgggg 



caatttcata 
aaccgggatt 
tgttctaatt 
tgccgaggta 
gaacatgccg 
atgtgaaaac 
actaaacctg 



tcccattgca 
caagaaggtt 
tgtatttacg 
caactgagac 
atcagagaaa 
gaaataatgg 
actgcggtca 
tgttactgtc 



cagataccaa 
aaccacccac 
gacacaaata 
gtaaagtatg 
aaattgttag 
aaagctgaag 
ggtcacacat 
gaggctgtcg 
gatgactcca 
cctcctgaga 
gctgatggtc 
ggggattatg 
atcccagttt 
atattgtggt 
tatattgttt 
gttgttctct 
aagtaacaat 



acaaggtgac atgggcactc 
tgcccgacgg tgaaaagtac 
tcgagtcccg ttttgaccgc 
acatgtgtgg ttttgcagct 
ctactctgat ggcccaggtg 
taggaaagaa cttgattggg 
cacttaacac attgcctgac 
ggctcataag ggatgcacca 



caagagaacc 
tggtggcaca 
ttggccatgc 
cagctggaac 
tcagaaaacg 
ccatgaccgt 
tgttgagact 
acgggaatgc 
tggacatcac 
gtacattatc 
taagtgctgg 
agtaataata 
actcgtagag 



aaatgctttg 
agatgagaag 
tattgagact 
agttatggca 
tgtggttgac 
agagaagttt 
catccttctg 
actcgatgaa 
ttctgtatgc 



gcccccgcat 
gctgcctccg 
aggcgccatc 
tccagggtct 
ggcgcaggcc 
ctggtggtga 
acccacaaca 
aaaaatatgg 
cggtgcacat 
gctgcattcc 
gattcatctg 
gcattctacc 
agggagctag 
ttctttgagt 
gcatatgcta 
gaaagtggcc 
gggacaggat 
actgacatgt 
atactaaagc 
aaaaacatca 
aaaggaccgc 
accctacatg 
tagtactagg 



ttcactgcct cattggtata 
tttgagtttg tagagaacca 
gtaaagaagc ttcactacat 
atttaaacca aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1628 



mays 
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60 


Met 


Ala 


Ala 


Ser 


Ala 


Ser 


Ser 


Leu 


Leu 


Ala 


Ala 


Pro 


Ala 


Ser 


Ser 


Ser 


61 


1 








5 










10 










15 




63 


Cys 


Gly Ala 


He 


Ser 


Pro 


Gin 


Leu 


Pro 


Arg 


Gly 


Ala 


Pro 


Ala 


Ala 


Ala 


64 








20 










25 










30 






66 


Ser 


Val 


Ala 


Ser 


Pro 


Ser 


Arg 


His 


Ser 


Cys 


Tyr 


Leu 


Leu 


Arg 


Ala 


Ser 


67 






35 










40 










45 








69 


Pro 


Ser 


Arg 


Arg 


His 


Arg 


Ser 


Arg 


Phe 


Val 


Ala 


Asn 


Ala 


Ala 


Pro 


Thr 


70 




50 










55 










60 










72 


Met 


Gin 


Pro 


Pro 


Ala 


Glu 


Ser 


Arg 


Val 


Ser 


Thr 


Val 


Val 


Asp 


Val 


Asp 


73 


65 










70 










75 










80 


75 


Leu 


Gly 


Asp 


Arg 


Ser 


Tyr 


Pro 


He 


Tyr 


He 


Gly 


Ala 


Gly 


Leu 


Leu 


Asp 


76 










85 










90 










95 




78 


Glu 


Pro 


Asp 


Leu 


Leu 


Gin 


Arg 


His 


Val 


His 


Gly 


Lys 


Arg 


Val 


Leu 


Val 


79 








100 










105 










110 






81 


Val 


Thr 


Asn 


Thr 


Thr 


Val 


Ala 


Pro 


Leu 


Tyr 


Leu 


Asp 


Lys 


Val 


Thr 


Trp 


82 






115 










120 










125 








.£4 


Ala 


Leu 


Thr 


His 


Asn 


Asn 


Leu 


Asn 


Val 


Ser 


Val 


Glu 


Ser 


Val 


He 


Leu 


"85 




130 










135 










140 










i8 7 


Pro 


Asp 


Gly 


Glu 


Lys 


Tyr 


Lys 


Asn 


Met 


Asp 


Thr 


Leu 


Met 


Lys 


Val 


Phe 


88 


145 










150 










155 










160 


So 


Asp 


Lys 


Ala 


Val 


Glu 


Ser 


Arg 


Phe 


Asp 


Arg 


Arg 


Cys 


Thr 


Phe 


Val 


Ala 


#i 










165 










170 










175 




93 


Leu 


Gly Gly 


Gly 


Val 


He 


Gly 


Asp 


Met 


Cys 


Gly 


Phe 


Ala 


Ala 


Ala 


Ala 


i94 








180 










185 










190 






96 


Phe 


Leu 


Arg 


Gly 


Val 


Asn 


Phe 


He 


Gin 


He 


Pro 


Thr 


Thr 


Leu 


Met 


Ala 


97 






195 










200 










205 










Gin 


Val 


Asp 


Ser 


Ser 


Val 


Gly 


Gly 


Lys 


Thr 


Gly 


He 


Asn 


His 


Pro 


Leu 


10 


0 


210 








215 








220 









"102 Gly Lys Asn Leu He Gly Ala Phe Tyr Gin Pro Gin Cys Val Leu He 

103 225 230 235 240 

-105 Asp Thr Asn Thr Leu Asn Thr Leu Pro Asp Arg Glu Leu Ala Ser Gly 

il06 245 250 255 



108 He Ala Glu Val Val Lys Tyr Gly Leu He Arg Asp Ala Pro Phe Phe 

109 260 265 270 

111 Glu Trp Gin Glu Lys Asn Met Pro Lys Leu Leu Ala Arg Glu Pro Asn 

112 275 280 285 

114 Ala Leu Ala Tyr Ala lie Lys Arg Ser Cys Glu Asn Lys Ala Glu Val 

115 290 295 300 

117 Val Ala Gin Asp Glu Lys Glu Ser Gly Leu Arg Ala Thr Leu Asn Leu 

118 305 310 315 320 

120 Gly His Thr Phe Gly His Ala He Glu Thr Gly Thr Gly Tyr Gly Ala 

121 325 330 335 

123 Trp Leu His Gly Glu Ala Val Ala Ala Gly Thr Val Met Ala Thr Asp 

124 340 345 350 

126 Met Ser His Arg Leu Gly Trp He Asp Asp Ser He Arg Lys Arg Val 

127 355 360 365 

129 Val Asp He Leu Lys Gin Ala Lys Leu Pro He Ala Pro Pro Glu Thr 

130 370 375 380 

132 Met Thr Val Glu Lys Phe Lys Asn He Met Ala Val Asp Lys Lys Val 
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133 385 390 395 

135 Ala Asp Gly Leu Leu Arg Leu lie Leu Leu Lys 

136 405 410 

138 Cys Val Phe Thr Gly Asp Tyr Asp Gly Asn Ala 

139 420 425 

141 His Ala Phe Cys Asp Asn 

142 435 

144 <210> SEQ ID NO: 3 

145 <211> LENGTH: 1358 

146 <212> TYPE: DNA 

147 <213> ORGANISM: Oryza sativa 

149 <400> SEQUENCE: 3 

150 gcacgagtac agcgtcgtct tcgtttcgca gctccaccac 

151 cctcctctct gctcgccgcc gcctcgtcgt cctcccgcgc 

152 gcgcccccag cgcgagtccg gccgcggcgg cctccctccc 

153 gcgctcctcc cctccgcgct tccgcagcga ggaccctccg 
,154 cggcccccgc tatgcagccg ccgccggcgt cgagggtctc 
155 tcggcgaccg gagctacccg atctacatcg gcgcaggcct 
-15 6 tgcagaggca tgttcatggg aagagggttt tggtggtgac 
157 tctacctgga gaaggtgacc tgggcactca cgcacaacaa 
-158 gcgtgatcct gcccgacggc gagaagtaca aggacatggg 
^159 acaaggcagt cgagtcccgg ctggaccggc ggtgcacgtt 
"160 ttattgggga catgtgcggt tttgcagctg ctgcgttcct 

1 6 1 agattcctac tactctgatg gcccaggtgg attcatctgt 
; 162 accatccatt ggggaagaac ttaattgggg cgttctacca 

163 acaccgagac actgaataca ttgcctgaca gggaactggc 

164 tgaagtatgg tctcataaga gatgcaccgt tctttgaatg 

165 cattattagc aagagaacca agtgctctgg cctatgctat 

166 aagctgaagt ggttgctcag gacgagaagg aaagtggtct 

167 gccatacatt tggccatgct atagaaacag gaactggcta 
16 8 aggctgttgc agctggaaca gttatggcag ctgacatgtc 

■1169 acgagtcaat caagaaacgg gcaattgaca tactagagaa 
-170 ctccagaggc catgacagtg gagaagttca aaagtattat 

171 ctgatggatt gctgaggctc atcctcctga aaggacctct 

172 gcgattactg ttcttcgagg tcgacgtgtc gataaatt 

174 <210> SEQ ID NO: 4 

175 <211> LENGTH: 436 

176 <212> TYPE: PRT 

177 <213> ORGANISM: Oryza sativa 

179 <400> SEQUENCE: 4 

180 Met Ala Ala Ala Ala Ser Ser Ser Leu Leu Ala 

181 15 10 

183 Ser Arg Ala Ala Ala Val Ser Ala Arg Arg Ala 

184 20 25 

186 Ala Ala Ala Ala Ser Leu Pro Ser Pro Ser Arg 

187 35 40 

189 Pro Leu Arg Ala Ser Ala Ala Arg Thr Leu Arg 

190 50 55 

192 Ser Ala Ala Pro Ala Met Gin Pro Pro Pro Ala 



400 

Gly Pro Leu Gly Cys 
415 

Leu Asp Glu Thr Leu 
430 



ctccatggcg 
ggcggccgtg 
ctcgccgtct 
cagccgtgtc 
cacggttgtc 
tctcgacgag 
caacaccacc 
cccgaatgtt 
cacactgatg 
tgttgcgttg 
gcgtggtgtc 
tggagggaag 
cccacagtgt 
ttcaggcata 
gcaagagaaa 
taagagatcg 
ccgagcaaca 
tggagcatgg 
tcaccgcctg 
agcgaagctt 
ggccgttgat 
gggaagctgt 



gccgccgcct 
tccgcccgcc 
cgcgcctcct 
gtcgcgagcg 
gacgtcgacc 
cctgacctgc 
gtcgcgccgc 
tctgtggaga 
aaggttttcg 
ggaggtggcg 
aatttcatac 
accggcatta 
gtactgatag 
gctgaggtgg 
aacatgccag 
tgtgaaaaca 
cttaatctcg 
ctccatgggg 
ggttggatag 
ccaattacac 
aagaaggttg 
gttttcactg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1358 



Ala Ala Ser Ser Ser 
15 

Pro Ser Ala Ser Pro 

30 

Ala Ser Cys Ala Pro 
45 

Ser Arg Val Val Ala 
60 

Ser Arg Val Ser Thr 
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193 


65 










70 










75 










80 


195 


Val 


Val 


Asp 


Val 


Asp 


Leu 


Gly Asp 


Arg 


Ser 


Tyr 


Pro 


He 


Tyr 


He 


Gly 


196 










85 










90 










95 




198 


Ala 


Gly 


Leu 


Leu 


Asp 


Glu 


Pro 


Asp 


Leu 


Leu 


Gin 


Arg 


His 


Val 


His 


Gly 


199 








100 










105 










110 






201 


Lys 


Arg 


Val 


Leu 


Val 


Val 


Thr 


Asn 


Thr 


Thr 


Val 


Ala 


Pro 


Leu 


Tyr 


Leu 


202 






115 










120 










125 








204 


Glu 


Lys 


Val 


Thr 


Trp 


Ala 


Leu 


Thr 


His 


Asn 


Asn 


Pro 


Asn 


Val 


Ser 


Val 


205 




130 










135 










140 










207 


Glu 


Ser 


Val 


He 


Leu 


Pro 


Asp 


Gly 


Glu 


Lys 


Tyr 


Lys 


Asp 


Met 


Gly 


Thr 


208 


145 










150 










155 










160 


210 


Leu 


Met 


Lys 


Val 


Phe 


Asp 


Lys 


Ala 


Val 


Glu 


Ser 


Arg 


Leu 


Asp 


Arg 


Arg 


211 










165 










170 










175 




213 


Cys 


Thr 


Phe 


Val 


Ala 


Leu 


Gly 


Gly 


Gly 


Val 


He 


Gly 


Asp 


Met 


Cys 


Gly 


214 








180 










185 










190 






216 


Phe 


Ala 


Ala 


Ala 


Ala 


Phe 


Leu 


Arg 


Gly 


Val 


Asn 


Phe 


He 


Gin 


He 


Pro 


•2 1 7 






195 










200 










205 








"219 


Thr 


Thr 


Leu 


Met 


Ala 


Gin 


Val 


Asp 


Ser 


Ser 


Val 


Gly 


Gly 


Lys 


Thr 


Gly 


220 




210 










215 










220 










222 


lie 


Asn 


His 


Pro 


Leu 


Gly 


Lys 


Asn 


Leu 


He 


Gly 


Ala 


Phe 


Tyr 


His 


Pro 


-223 


225 










230 










235 










240 


225 


Gin 


Cys 


Val 


Leu 


He 


Asp 


Thr 


Glu 


Thr 


Leu 


Asn 


Thr 


Leu 


Pro 


Asp 


Arg 


2 2 6 










245 










250 










255 




122 8 


Glu 


Leu 


Ala 


Ser 


Gly 


He 


Ala 


Glu 


Val 


Val 


Lys 


Tyr 


Gly 


Leu 


He 


Arg 


229 








260 










265 










270 






231 


Asp 


Ala 


Pro 


Phe 


Phe 


Glu 


Trp 


Gin 


Glu 


Lys 


Asn 


Met 


Pro 


Ala 


Leu 


Leu 


232 






275 










280 










285 








23 4 


Ala 


Arg 


Glu 


Pro 


Ser 


Ala 


Leu 


Ala 


Tyr 


Ala 


He 


Lys 


Arg 


Ser 


Cys 


Glu 


23 5 




290 










295 










300 










.237 


Asn 


Lys 


Ala 


Glu 


Val 


Val 


Ala 


Gin 


Asp 


Glu 


Lys 


Glu 


Ser 


Gly 


Leu 


Arg 


■ 23 8 


305 










310 










315 










320 


-24 0 


Ala 


Thr 


Leu 


Asn 


Leu 


Gly 


His 


Thr 


Phe 


Gly 


His 


Ala 


He 


Glu 


Thr 


Gly 


-241 










325 










330 










335 




243 


Thr 


Gly 


Tyr 


Gly 


Ala 


Trp 


Leu 


His 


Gly 


Glu 


Ala 


Val 


Ala 


Ala 


Gly 


Thr 


244 








340 










345 










350 






246 


Val 


Met 


Ala 


Ala 


Asp 


Met 


Ser 


His 


Arg 


Leu 


Gly 


Trp 


He 


Asp 


Glu 


Ser 


247 






355 










360 










365 








249 


He 


Lys 


Lys 


Arg 


Ala 


He 


Asp 


He 


Leu 


Glu 


Lys 


Ala 


Lys 


Leu 


Pro 


He 


250 




370 










375 










380 










252 


Thr 


Pro 


Pro 


Glu 


Ala 


Met 


Thr 


Val 


Glu 


Lys 


Phe 


Lys 


Ser 


He 


Met 


Ala 


253 


385 










390 










395 










400 


255 


Val 


Asp 


Lys 


Lys 


Val 


Ala 


Asp 


Gly 


Leu 


Leu 


Arg 


Leu 


He 


Leu 


Leu 


Lys 


256 










405 










410 










415 




258 


Gly 


Pro 


Leu 


Gly 


Ser 


Cys 


Val 


Phe 


Thr 


Gly 


Asp 


Tyr 


Cys 


Ser 


Ser 


Arg 


259 








420 










425 










430 






261 


Ser 


Thr 


Cys 


Arg 


























262 






435 




























264 


<210> SEQ ID NO 


5 
























265 


<211> LENGTH: 1643 
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66 <212> TYPE: DNA 

67 <213> ORGANISM: Glycine max 

69 <400> SEQUENCE: 5 

70 gcacgagccc ggagtggtgt ttgtgttcaa ggaagacagg 

71 gttattgtta ttgttggctt cttcactcag caaccaccaa 

72 ccatcgctga ttcccccaat ggcttccact gccaccaatt 

73 aaccaacaaa ctccaatccc caaaccctct ttcttctcca 

74 aactctaata ataattgggc ctgggcctct gtttccacct 

75 gccacctcct ctcaagttat ggatccctcc gcagcaaaat 

76 atcgtcgaag tcgatttggg tagccggagc tatcctattt 

77 aaccaacccg actatctcca gaggcatgtg catggaaaga 

78 gaaaccgttg cgccacttta tctagacaag gttgttgatg 

79 aatgtttctg tggagagtgt aattttacct gatggtgagc 

80 cttatgaaag tctttgacaa ggccatcgag tcgcggctcg 
gctcttggtg gtggtgtgat tggcgacatg tgtggctttg 

82 ggtgttaatt ttattcagat tcctacaact gtgatggcac 

83 ggaaaaactg ggataaatca ccgccttggg aagaacatga 

84 cagtgtgtgc ttatagacac agacacatta aatacgctac 

85 gggctagcag aggttataaa gtatgggctc attagggatg 

86 gagaaaaata tgcacttatt attggcaaga gatcctagtg 
287 cgatcttgtg aaaacaaggc tgaggttgtg tccttagatc 
"288 gcaacattga acttgggtca tacatttggt catgcaatag 
.2 
,2 
.2 

2 

2 



2 
2 
2 
281 



89 cagtggcttc atggagaggc tgttgcagct ggcacggtaa 

90 cgcctaggtt ggattgatga ttctcttgtg aaacgagttg 

91 aagttaccca cagcccctcc tgagaccgtg actgtggaca 

92 gtggataaga aggtagcaga tgggttgcta aggcttatcc 

93 aattgtgttt tcacagggga ttatgacaga aaggctctag 

94 tgtaaatcct gatctttgtg ctccatgttt ctattgcagc 
295 gagtttttgt atgttgttat ataacctcac ttgtattttt 
296 
297 
299 
300 



accatggttc ccattatata cgcaagagta tattatttga 
ttattaaaaa aaaaaaaaaa aaa 
<210> SEQ ID NO: 6 
<211> LENGTH: 437 



cgcaggcaca 
accacaccgt 
tctctctttc 
ataacaacca 
ctcgcaagtc 
ccgaacccgc 
acatcggatc 
gagtcctagt 
ctttgacaag 
agtacaagga 
accggcggtg 
ctgcctctgc 
aggtcgattc 
tcggtacctt 
cggataggga 
cagagttttt 
taatggcata 
agaaggaaag 
aaactggggt 
tggctgttga 
gagacatttt 
tgtttaaatc 
ttctaaaggg 
acaatacgct 
tttatccaat 
cttatgtact 
aattgaggca 



ggcacatcat 
cgcacccatt 
tctctgcgcc 
tttgcacttc 
aaggatatgc 
tcttcccacc 
cgggttacta 
tgtaactaac 
gggaaacccg 
catggatact 
tacatttgtt 
cttcctacgt 
ttcagttggt 
ttaccaacct 
actggcatca 
tgagtggcaa 
tgctataaag 
tggactgagg 
aggctatggg 
catgtcatat 
aaaacaggct 
tgtcatggca 
tcctctaggc 
tcgtgcattc 
tccttttagt 
ttggatggca 
tgatcaataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1643 



301 <212> TYPE: PRT 



302 <213> ORGANISM: 
304 <400> SEQUENCE : 



Glycine max 
6 



305 


Met 


Ala 


Ser 


Thr 


Ala 


Thr 


Asn 


Phe 


Ser 


Leu 


Ser 


Leu 


Cys 


Ala 


Asn 


Gin 


306 


1 








5 










10 










15 




308 


Gin 


Thr 


Pro 


He 


Pro 


Lys 


Pro 


Ser 


Phe 


Phe 


Ser 


Asn 


Asn 


Asn 


His 


Leu 


309 








20 










25 










30 






311 


His 


Phe 


Asn 


Ser 


Asn 


Asn 


Asn 


Trp 


Ala 


Trp 


Ala 


Ser 


Val 


Ser 


Thr 


Ser 
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VERIFICATION SUMMARY DATE: 08/01/2001 

PATENT APPLICATION: US/09/744,099 TIME: 18:20:00 

Input Set : A:\BB1159 SEQ LST.txt 
Output Set: N:\CRF3\08012001\I744099.raw 

L:9 M:270 C: Current Application Number differs, Replaced Application Number 

L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:13 M:256 W: Invalid Numeric Header Field, Wrong Prior FILING DATE : YYYY-MM-DD 
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